Systems and methods for localized image registration and fusion

ABSTRACT

Systems and methods are described for co-registering, displaying and quantifying images from numerous different medical modalities, such as CT, MRI and SPECT. In this novel approach co-registration and image fusion is based on multiple user-defined Regions-of-Interest (ROI), which may be subsets of entire image volumes, from multiple modalities, where the each ROI may depict data from different image modalities. The user-selected ROI of a first image modality may be superposed over or blended with the corresponding ROI of a second image modality, and the entire second image may be displayed with either the superposed or blended ROI.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention is generally directed to medical imaging. More particularly, the present invention is directed to advanced image fusion systems and methods for use in image-assisted biopsy, image-assisted treatment planning, image-assisted treatment and image-assisted surgery, and molecular imaging.

2. Description of the Related Art

Nuclear medicine imaging tools, such as single-photon-emission-computed-tomography (SPECT) and positron-emission-tomography (PET) are known. Other types of medical imaging tools, such as magnetic resonance imaging (MRI), computed tomography (CT) or ultrasound (US), are also well known. Each of these known imaging techniques (modalities) provides a clinician with a different type of data and is useful for a different purpose, such as mainly functional (SPECT, PET) or mainly anatomical imaging (MR, CT, US). Until recently, images from only two modalities were combined in one display image. Combining these images usually requires two processes: 1) co-registration and 2) Image Fusion.

A medical imaging procedure typically creates a volume or matrix of measured values that forms the basis for that modality's image of the patient. Each value in the matrix represents image data at the geometric center of a voxel (volume element) of the volume. Co-registration is a process where the voxels in each of the images obtained by different modalities are transformed to correspond to a common reference frame. This can be done either via a rigid body or non-rigid body (“elastic”) transformation of the pixel positions. In a rigid body transformation, only scaling, translations, and rotations are allowed, keeping the relative positions between all voxels constant. In an elastic transformation, additional to the fore-mentioned rigid body transformation, the voxel positions may also be elastically deformed, and relative distance between all voxels in the input image positions does not have to be preserved.

Image Fusion (IF) is a technique to display co-registered images obtained from two different imaging devices. IF can be characterized by the following: Let I_(i) denote an image volume of modality M_(i). In a previous step, separate image volumes from different devices are co-registered, in order to have image values at common pixel positions {right arrow over (r_(j))}={right arrow over (r_(j) ¹)}={right arrow over (r_(j) ²)},∀j, with pixel index j within the coregistered volume. In the IF technique, the fused image I₃ of images I₁ and I₂ is displayed on a display device with the display-pixel color and intensity determined from a function of the coregistered image values of I₁ and I₂. The function is often, but not limited to a linear combination of the image pixel value in each image (“alpha-blending”), which can be easily mathematically represented by: I _(i) =I _(i)({right arrow over (r_(j))})∀j; I ₃ =c ₁ I ₁ +c ₂ I ₂;   (Eq. 1) In general: I₃=f(I₁, I₂), where f may be any function combining the 2 images, however all pixels are involved.

By this technique, the images from two different devices or modalities (1 and 2) are displayed simultaneously in the same region in space, if the volumes are accurately co-registered. With current techniques, the entire image volumes of two images are fused using constant coefficients c₁ and c₂. The color table lookup index, T, is derived from I₃.

Current techniques are known to work well if images with similar resolution and noise characteristics are fused, or if the image information is equally distributed throughout the image volume in all images to be fused. However, if the resolution and noise characteristics are quite different, as is the case, for example, with functional and anatomical imaging, such as SPECT and CT images or MRI images, then the information delivered to an observer can be less than optimal, and more difficult to interpret with a simple IF technique. For example, referring to FIG. 1, the SPECT image of a prostate is fused with a CT image of the same prostate by the current IF technique. One can see that the critical information for the prostate in the center of the resulting image 100 is hard to read, because the entire SPECT image with its high noise and low spatial resolution is superimposed over the low-noise and high-spatial resolution CT image. In this example, the information of interest in the SPECT image is a small region of focal uptake (“Region of Interest”) corresponding to the functioning of the prostate, while the anatomical relevant information in the CT image is generally distributed throughout the entire image slice.

Current methods for combining images also fail to address the need of multi-modality imaging for quantitative use, treatment planning and monitoring, for systems with which more than two modality volumes could be registered, such as Ultrasound or CT, and SPECT/PET.

Another method to display co-registered images, shown in FIG. 3, provides two images from different devices or different modalities side by side. Here, areas of the images can be correlated by manipulating a correlated cursor 302, which points to the same region on each view. This method is also not optimal because the images are not overlaid at all, and the clinician is forced to make assumptions and estimations by visually comparing two separate images, with only the correlated cursor as an aid.

Accordingly, there is a need for new and improved systems and methods of combining image volumes from disparate imaging devices, to enhance clinical results and/or image-assisted biopsy, image-assisted treatment planning, image-assisted treatment and image-assisted surgery, and molecular imaging.

SUMMARY OF THE INVENTION

The present invention provides new and improved systems and methods for co-registering, displaying and quantifying images from different medical devices or imaging modalities. By blending or replacing planar Regions-of-Interest (ROI) and/or Volumes-of-Interest (VOI) from different imaging devices, a novel display with improved utility can be generated. The size and location of the ROI or VOI is determined by the type of imaging modality and the clinical user. Any number of different modalities may be combined into a single image such that the clinician can obtain the most useful image data in a single unified image. By registering and fusing selected ROIs and/or VOIs, rather than entire image volumes, processing may be reduced and real-time or near real-time images may be obtained.

In one embodiment of this novel approach, co-registration and image fusion are based on multiple (i.e., at least two) user-defined ROIs or VOIs. These ROIs or VOIs would be smaller than the entire coregistered volume, but larger than just a few pixels. The ROIs/NOIs of the object data would be coregistered with the information in the target data. This embodiment would be faster than classical coregistration or more accurate than Landmark registration. Classical co-registration is done by using information from the largest possible VOIs from each volume. Each voxel of one image modality is correlated to a corresponding voxel in another image modality, and using all possible voxels. Landmark co-registration is another well-known method, where the landmarks are the smallest possible ROIs, namely individual voxels. In Landmark co-registration, a user correlates a few points on each image modality that the user deems to relate to the same structure, and the whole image volume then can be co-registered by applying the transformation matrix that is derived from the selected landmark points. Beneficiaries are multi-modality fusion applications of diagnostic imaging, treatment planning, and monitoring.

In accordance with one embodiment of the present invention, a system for localized image fusion is provided. The system includes an image fusion unit configured to receive first and second imaging data respectively from first and second imaging devices. The unit is capable of registering the first imaging data with the second imaging data and blending a first region-of-interest (being a subset of the first imaging data) with a second region-of-interest (being a subset of the second imaging data). The first and second regions-of-interest correspond (e.g., relate to the same body part, etc.). The system may display a fused image comprising the first or second imaging data and the blended first and second regions-of-interest.

Further applications and advantages of various embodiments of the present invention are discussed below with reference to the drawing figures.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is an image of a prior art fused image;

FIG. 2 is an image based in one modality with a region-of-interest displayed as a co-registration or blend of the first modality with image data of a second modality, according to an embodiment of the present invention;

FIG. 3 is a prior art side-by-side image; and

FIG. 4 is a block diagram of an imaging system according to an embodiment of the present invention.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

Throughout this document, a Region-of-Interest (ROI) is meant to refer to a contractible, and thus a simply connected subset of image pixels within one slice (i.e. a two-dimensional plane)of a total image volume. The smallest ROI is one pixel, and the largest is the entire slice. A Volume-of-Interest (VOI) extends the notion of a ROI to three dimensions, with the smallest unit being a voxel, i.e. a three-dimensional pixel. That is, a VOI is a contractible, and thus simply connected subset of image voxels from the entire image volume in three dimensional space.

The present invention is able to produce blended images from disparate imaging devices, which produce data in different modalities. One advantage of the present invention is the ability to register and/or fuse a portion of a first image volume with a second image volume, without registering and/or fusing the entire image volumes. This is accomplished by allowing ROIs or VOIs to be selected (manually or automatically) for fusion. The selected ROIs or VOIs in one modality can be overlaid (i.e., superimposed) or blended with data from a corresponding ROI or VOI in a second modality.

FIG. 2 shows an exemplary image created by the ROI fusion technique of the present invention. The region of interest in the functional SPECT object image is limited to the area of focal uptake within a specific organ. The relevant information in the anatomical CT target image is distributed throughout the entire image slice. Within a coregistered CT image 200 (composed of multiple slices), an ROI 202 of a single slice of the image 200 corresponding to that specific organ is identified and selected for fusion. As shown, ROI 202 in the CT image is overlaid with the corresponding data from the same ROI in a nuclear SPECT image. That is, the entire image slice of CT image 200 remains intact except for the ROI 202. Thus, through the present invention, a clinician is capable of viewing any desired object image data from any one modality superposed on a target image from another modality, and yet is able to maintain the spatially accurate anatomical image from the second modality as a reference.

For simplicity, FIG. 2 shows the case of two modalities being blended in a single ROI, but the invention is not so limited and can be extended to more than two modalities and any number of ROIs and/or VOIs. For example, a CT image could have one ROI superimposed with ultrasound image data and a second ROI superimposed with nuclear medical image data.

FIG. 4 is a block diagram a system for image fusion of disparate imaging data according to an embodiment of the present invention. System 400 includes an image fusion device 402 coupled with two or more disparate imaging devices (i.e., different modalities), such as MRI unit 404, SPECT unit 406, Ultrasound unit 408, PET unit 410, CT unit 412 and AX (angioplasty X-ray) unit 414. Image fusion device 402 is configured to receive imaging data from each of the disparate imaging devices. Imaging data may be received in a common data protocol, such as the DICOM (Digital Imaging and COmmunication in Medicine) standard.

Image fusion device 402 is configured to process and filter image data as well as to co-register image voxel data received. Image fusion device 402 may include a client interface (not shown) to allow a user of the system to select ROIs and VOIs, display parameters, etc. according to the present invention. Accordingly, image fusion device 402 may include a keyboard or other I/O device 402 a, a color video display monitor or other display device 402 b, and memory 402 c for storing image data, for example.

Image fusion device 402 may be any combination of hardware and software configured to perform the functions of the present invention, and may be in any configuration (central computer, distributed architecture, etc.). In one exemplary embodiment, the system will include a graphical user interface (not shown) which allows a user of the system to view image data, select ROI and VOIs, view blended images, and other functionality described herein.

To create a combined or fused image, such as the image of FIG. 2, image data of two separate modalities (M1 and M2) are co-registered (i.e., their pixel (voxel) data is aligned). In this case, the CT data of the prostate is registered with the SPECT data of the same prostate. The entire image volumes need not be co-registered, but instead, selected ROI(s) or VOI(s) can be co-registered. Depending upon the type of registration performed, it may be more desirable to register only ROIs or only VOIs that are subsets of the entire image volume. For example, in a system using rigid body registration, when an ROI is of an organ that may move during the imaging period, such as the heart, then there is an advantage to registering the ROIs of the heart rather than the entire image.

Once the image data is co-registered, a composite image then can be constructed for the scaled ROI 202 of object data Ml displayed with color table T1, fused with full frame target data M2 displayed with color table T2. Through coregistration, the pixel locations of the ROI of the object M1 are registered and scaled to match the corresponding pixel locations and size of the target M2. Then, the coregistered ROI data of the first image M1 can be superposed with the corresponding ROI of the entire second target image M2. The color tables T1 and T2 can be set and scaled independently. The same alternatives could be used to display the entire first image M1 with a superposition of ROI data from second image M2. Techniques for co-registering image data are known. For example, many registration techniques are described in Maintz, J. B. A., & Viergever, M. A., A Survey of Medical Image Registration, Navigated Brain Surgery (1999), the entire contents of which are incorporated by reference herein.

The ROIs of the two images, M1 and M2, can be blended, and the blended ROI data superposed with the ROI of the partial or entire second image M2. A color table T3 of the ROI in M2 may be set independently from the full frame M2 image, so that the overlying M1 ROI data may appear semi-transparent or opaque, as controlled by the user (e.g., via a client interface).

The source images for M1 and M2 may be independently filtered with similar or different filters prior to display in the composite image.

Each ROI also is capable of showing time-stamped images and allowing for “movement” or flow depiction. For example, a CT image may be combined with an ultrasound image showing blood flow in an organ. The blood flow could be shown in blended ROI on the CT image. Therefore, when an ROI in an image is superimposed with image data from another device, the image data may be streaming in real-time or near real-time.

Image data can be mathematically represented by the following equation: I _(i) =I _(i)({right arrow over (r _(j) )})∀ jε {ROI _(n=1) , . . . , ROI _(n=N)}, which may be mathematically summarized as ${{I = {\sum\limits_{K}^{M}{c_{nk}I_{k}}}};\quad{\forall{j \in \left\{ {{ROI}_{{n = 1},\ldots\quad,}{ROI}_{n = N}} \right\}}}},$ or in general I=f(I₁, . . . , I_(k)), where f is any function, but where each image I_(n), contains one or more ROI_(n), and only the pixels within these ROI are used for the registration, and fusion or in general, and where M modalities are fused, with N ROIs having coefficients c_(nk).

Interactive 3D-move/shape deformation and multi-modality may be displayed in object data ROI. The ROI/NOI may be generated on single modality displays of either image or on the composite image.

The ROI/NOI also may be generated by a combination (i.e. union or intersection) of ROIs/NOIs and/or of separate ROI/NOIs created in either image, or on the composite image. The ROI/NOI may be created, moved or resized by user operations on single modality displays of either image, or on the composite image. The ROI may be a projection of a 3D VOI.

3D ROI embedded maximum intensity projection (MIP) may be generated and displayed (i.e., MIP of M1 over MIP of M2). Through coregistration, the pixel locations of the VOI of M1 are registered and scaled to match the pixel locations and size of M2. A first MIP image is made up of the selected VOI of the object M1. The scaled VOI of M1 replaces the same registered VOI of the target M2. A MIP image of the combined volume is then created. The scaling intensity and color table type of T1 and T2 that are used to create the display pixel appearance of the combined projection can be adjusted independently by the user.

A second or alternate MIP image is then made up of intensity-scaled VOI of M2. The object VOI of M2 is added to the same registered VOI of the target M1. The scaling intensity and color table type of T1 and T2 that are used to create the display pixel appearance of the combined projection can be adjusted independently by the user.

A third or combined MIP image is then made up of intensity-scaled VOI of M1 and M2. Through coregistration, the VOI of M1 is registered and scaled to match the pixel location and size of the target M2. The scaled VOI of M1 is added to a scaled version of the same registered VOI of M2. The combined VOI replaces the same registered VOI of the target M2. A MIP image of the combined volume is then created. The scaling intensity and color table type of the VOI in M1 and the VO1 in M2 that are used to create the combined VOI can be adjusted independently by the user.

Images from modalities M1 and M2 may be segmented into separate organ or shape zones by a segmentation technique. The image correlation and coherence between the co-registered images is reported by zone and segmentation technique.

The present invention can be extended to more than two modalities, where all but one of the modalities represent the object modalities and the remaining one modality represents the target modality, i.e. different ROIs may show various modality information.

The present invention can be extended to multiple image series in two or more modalities, where all of the series in one modality and all but one of the series in the second modality represent the object modality, and the remaining one image in the second modality represents the target modality. In this explanation, series may mean images acquired at different times, in different studies, dynamic images, gated images, or other combinations. Each of the images may be independently registered with the target image.

The present invention can be used for manual and semi-automatic registration, or to either initialize or fine tune auto-registration, where structures within ROIs are used for registration.

One skilled in the art will understand that the present invention can be extended and used for interventional procedures, as well as for Partial Volume Correction.

Thus, a number of preferred embodiments have been fully described above with reference to the drawing figures. Although the invention has been described based upon these preferred embodiments, it would be apparent to those of skilled in the art that certain modifications, variations, and alternative constructions would be apparent, while remaining within the spirit and scope of the invention. 

1. A system for localized image fusion, comprising: an image fusion unit configured to receive target and object imaging data respectively from first and second disparate imaging devices, to register a selected region of interest of said target imaging data with said object imaging data, to blend a first region of interest being a subset of said target imaging data with a second region of interest being a subset of said object imaging data corresponding to said first region of interest, and to display a fused image comprising said target imaging data and the blended first and second regions-of-interest.
 2. The system as recited in claim 1, wherein said target imaging data is anatomical (e.g. CT) data and said object imaging data is functional (e.g. SPECT) data:
 3. The system as recited in claim 1, wherein said image fusion unit is further configured to receive third imaging data from a third imaging device, to blend a third region-of-interest being a subset of said first imaging target data with a fourth region of interest being a subset of said third imaging object data, and to display a fused image comprising said first imaging target data, the blended first and second regions-of-interest, and the blended third and fourth regions-of-interest.
 4. The system as recited in claim 1, wherein said blended first and second regions-of-interest is based on giving said second region-of-interest a display weight higher or independently variable than a weight given said first region-of-interest.
 5. The system as recited in claim 1, wherein said blended first and second regions-of-interest is a weighted sum or combination of said first and second regions-of-interest, said second region-of-interest being given a weight higher or independently variable than a weight given said first region-of-interest.
 6. The system as recited in claim 3, wherein said blended first and second regions-of-interest is a weighted sum or combination of said first and second regions-of-interest, said second region-of-interest being given a weight higher or independently variable than a weight given said first region-of-interest, and said blended third and fourth regions-of-interest is a weighted sum or combination of said third and fourth regions-of-interest, said fourth region-of-interest being given a weight higher or independently variable than a weight given said third region-of-interest.
 7. The system as recited in claim 1, further comprising a color display monitor for displaying said blended image.
 8. A method for image registration and fusion, comprising the steps of: receiving first image data of a first modality; receiving second image data of a second modality different from said first modality; registering at least one portion of said first image data with said second image data; blending a first region-of-interest of said first image data within said at least one portion with a second region-of-interest of said second image data to create a blended region-of-interest; and rendering a combined image comprising said second image data and said blended region-of-interest.
 9. The method according to claim 8, wherein said step of blending comprises a step of summing or combining pixel data of said first region-of-interest with pixel data of said second region-of-interest to form said blended region-of-interest or the display information for the region of interest.
 10. The method according to claim 8, wherein said rendering step includes a step of displaying said first image data with a first color format and said blended region-of-interest in a second color format, said blended region-of-interest being displayed in a location corresponding to said first region-of-interest.
 11. The method according to claim 8, further comprising steps of: selecting the region-of-interest from said first image data, said first region of interest being a subset of said first image data; and selecting the second region-of-interest from said second image data based upon the location of said first region-of-interest, said second region of interest being a subset of said first image data.
 12. A system for localized image fusion, comprising: an image fusion unit configured to receive first and second imaging data respectively from first and second imaging devices, to register said first imaging data with said second imaging data, to blend a first region-of-interest being a subset of said first imaging data with a second region-of-interest being a subset of said second imaging data corresponding to said first region of interest, and to display a fused image comprising said first imaging data and the blended first and second regions-of-interest, and/or to display a fused image comprising said second imaging data and the blended first and second regions-of-interest.
 13. A system for localized image fusion, comprising: an input means for receiving first and second imaging data respectively from first and second imaging devices, a registration means for registering said first imaging data with said second imaging data; a blending means for blending a first region-of-interest being a subset of said first imaging data with a second region-of-interest being a subset of said second imaging data corresponding to said first region of interest; and a display means for displaying a fused image comprising said first imaging data and the blended first and second regions-of-interest.
 14. The system as recited in claim 13, wherein said first imaging data is anatomical data and said second imaging data is functional data.
 15. The system as recited in claim 13, wherein said input means receives third imaging data from a third imaging device, said blending means blends a third region-of-interest being a subset of said first imaging data with a fourth region of interest being a subset of said third imaging data, and said display means displays a fused image comprising said first imaging data, the blended first and second regions-of-interest, and the blended third and fourth regions-of-interest.
 16. The system as recited in claim 13, wherein said blending means weights the combination of said first and second regions-of-interest during said blending.
 17. The system as recited in claim 16, wherein said blended first and second regions-of-interest is a weighted sum of said first and second regions-of-interest, said second region-of-interest being given a weight higher than a weight given said first region-of-interest.
 18. The system as recited in claim 16, wherein said blended first and second regions-of-interest is a weighted sum of said first and second regions-of-interest, said second region-of-interest being given a weight higher than a weight given said first region-of-interest, and said blended third and fourth regions-of-interest is a weighted sum of said third and fourth regions-of-interest, said fourth region-of-interest being given a weight higher than a weight given said third region-of-interest.
 19. An image fusion system, comprising: a plurality of imaging devices, at least two devices of said plurality generating image data from different modalities; an image fusion unit couple with said plurality of images and configured to receive first and second imaging data respectively from first and second imaging devices of said plurality, to register said first imaging data with said second imaging data, to blend a first region-of-interest being a subset of said first imaging data with a second region-of-interest being a subset of said second imaging data corresponding to said first region of interest, and to display a fused image comprising said first imaging data and the blended first and second regions-of-interest, wherein said first and second imaging data are different modalities.
 20. The system as recited in claim 19, wherein said first imaging device comprises a CT scanner and said second imaging device comprises a nuclear imaging device.
 21. The system as recited in claim 19, wherein said image fusion unit is further configured to receive third imaging data from a third imaging device, to blend a third region-of-interest being a subset of said first imaging data with a fourth region of interest being a subset of said third imaging data, and to display a fused image comprising said first imaging data, the blended first and second regions-of-interest, and the blended third and fourth regions-of-interest.
 22. The system as recited in claim 19, wherein said blended first and second regions-of-interest is based on giving said second region-of-interest a weight higher than a weight given said first region-of-interest.
 23. The system as recited in claim 19, wherein said blended first and second regions-of-interest is a weighted sum of said first and second regions-of-interest, said second region-of-interest being given a weight higher than a weight given said first region-of-interest.
 24. The system as recited in claim 21, wherein said blended first and second regions-of-interest is a weighted sum of said first and second regions-of-interest, said second region-of-interest being given a weight higher than a weight given said first region-of-interest, and said blended third and fourth regions-of-interest is a weighted sum of said third and fourth regions-of-interest, said fourth region-of-interest being given a weight higher than a weight given said third region-of-interest.
 25. The system as recited in claim 19, further comprising a color display monitor for displaying said blended image.
 26. The system as recited in claim 19, wherein said first imaging device comprises an Ultrasound device and said second imaging device comprises a nuclear imaging device.
 27. The system as recited in claim 19, wherein said first imaging device comprises a CT device and said second imaging device comprises a SPECT device.
 28. An image fusion device, comprising: an input unit for receiving imaging data from disparate imaging devices; a graphic user interface configure to display said imaging data, to allow selection of regions-of-interest within said imaging data to be blended, and to display a blended image; and a processing unit coupled with said input unit and said graphical user interface and configured to co-register said imaging data from disparate imaging devices, to blend selected regions-of-interest, and to generate display data for a fused image comprising said blended regions-of-interest.
 29. The system as recited in claim 28, wherein said imaging data includes first image data and second image data corresponding to first and second imaging devices respectively; said interface is configured to allow selection of a first region-of-interest (ROI) from said first imaging data and a second region-of-interest (ROI) from said second imaging data; said processing unit is configured to co-register said first imaging data with said second imaging data and to blend said first ROI with said second ROI to create a first blended ROI, and to generate display data comprising said first image data and said first ROI superimposed with the first blended ROI.
 30. The system as recited in claim 29, wherein said first imaging data is CT data and said second imaging data is SPECT data.
 31. The system as recited in claim 29, wherein said imaging data includes third imaging data from a third imaging device; said interface is configured to allow selection of a third ROI from said third imaging data and to select a fourth ROI from said first imaging data; said processing unit is configured to co-register said first imaging data with said third imaging data and to blend said third ROI with said fourth ROI to create a second blended ROI, and to generate display data comprising said first image data, said first ROI superimposed with said first blended ROI, and said fourth ROI superimposed with said second blended ROI.
 32. The system as recited in claim 31, wherein said first and second blended ROIs are generated based upon giving said second region-of-interest data a weight higher than a weight given said first region-of-interest data and giving said third region-of-interest data a higher weight than said fourth region of interest data.
 33. The system as recited in claim 28, wherein said system is configured to display a single image in a first modality having a plurality of ROIs superimposed with data from images of other modalities.
 34. The system as recited in claim 1, wherein said object imaging data is anatomical (e.g. CT) data and said target imaging data is functional (e.g. SPECT) data.
 35. The system as recited in claim 13, wherein said first imaging data is functional data and said second imaging data is anatomical data. 